Frame-by-frame Phoneme Classification Using Mlp

نویسنده

  • DOMOKOS JÓZSEF
چکیده

In this paper, we present some practical experiments for continuous speech frame-by-frame phoneme classification using Multi Layer Perceptron (MLP) neural networks. We used to train and test our software application, the the OASIS Numbers speech database. In our experiments, we tried to classify all the existing 32 phonemes together, from OASIS Numbers database dictionary. We also used different MLP configurations to compare the achieved results. For classification, we used 13 MFCC coefficients and their first and second order derivatives (delta parameters) extracted from speech signal using our Matlab based feature extractor software application.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phoneme segmentation of continuous speech using multi-layer perceptron

In this paper, we propose a new method of phoneme segmentation using MLP(multi-layer perceptron). The structure of the proposed segmenter consists of three parts: preprocessor, MLP-based phoneme segmenter, and postprocessor. The preprocessor utilizes a sequence of 44 order feature parameters for each frame of speech, based on the acoustic-phonetic knowledge. The MLP has one hidden layer and an ...

متن کامل

A Continuous Speech Recognition System Embedding MLP into HMM

Nelson Morgan IntI. Compo Sc. Institute 1947 Center Street. Suite 600 Berkeley. CA 94704. USA We are developing a phoneme based. speaker-dependent continuous speech recognition system embedding a Multilayer Perceptron (MLP) (Le .• a feedforward Artificial Neural Network). into a Hidden Markov Model (HMM) approach. In [Bourlard & Wellekens]. it was shown that MLPs were approximating Maximum a Po...

متن کامل

Single frame selection for phoneme classification

Our former study [1] has shown that maximum likelihood (ML) based frame selection, which selects reliable frames from a high resolution along the time axis, helps to improve the discrimination between phonemes. In this paper, we present our recent research on single frame selection for a phoneme classification task. A new single selection, which only selects one frame for one state in an Hidden...

متن کامل

Parallel and hierarchical speech feature classification using frame and segment-based methods

Phonemes in the English language can be represented using either parallel or hierarchical distinctive speech features. There have been a number of efforts to integrate multiple information sources but none of these efforts addressed the issue of combining multiple sets of articulatory/linguistic features with different organization topologies. In this study, we combine a frame-based parallel sp...

متن کامل

Phoneme Recognition using Competitive Neural Trees

This paper applies the Competitive Neural Tree (CNeT) method to phoneme recognition, a pattern classiication problem. CNeTs combine the advantages of Decision Trees and Competitive Neural Networks. The CNeT algorithm works by hierarchically clustering given examples while growing a tree. Diierent search methods, as well as stopping and splitting criteria are discussed. The CNeT algorithm allows...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008